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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 
APPLICANT: 



(i) 

(ii) 

(iii) 
(iv) 



Nelson, Edward L. 
Nelson, Peter J. 
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m 

m 
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CO 



TITLE OF INVENTION: NOVEL VECTOR FOR 
POLYNUCLEOTIDE VACCINES 

NUMBER OF SEQUENCES: 29 

CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: MORGAN & FINNEGAN, L . L . 

(B) STREET: 345 PARK AVENUE 

(C) CITY: NEW YORK 

(D) STATE: NEW YORK 

(E) COUNTRY: USA 

(F) ZIP: 10154 
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(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: DISKETTE, 3.5 INCH, 1.44 MB 

(B) COMPUTER: IBM PC COMPATIBLE* 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: WORDPERFECT 5.1 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US97/143 06 

(B) FILING DATE: 14-AUG-1997 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US60/023931 

(B) FILING DATE: 14-AUG-1996 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: KATHRYN M. BROWN 

(B) REGISTRATION NUMBER: 34 556 

(C) REFERENCE/DOCKET NUMBER: 2026-4236US1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 758-4800 

(B) TELEFAX: (212) 751-6849 

(C) TELEX: 421792 



(2) INFORMATION FOR SEQ ID NO:l: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: cDNA 
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m 0 



76 - 



(iii) HYPOTHETICAL: No 

(iv) ANTI- SENSE: No 

(xi) SEQUENCE DESCRI PTION : SEQ ID NO:l: 

GGCCGCGTTG CTGGCGTTTT TCCATAGGCT CCGCCCCCCT 4 0 

GACGAGCATC ACAAAAATCG ACGCTCAAGT CAGAGGTGGC 80 

GAAACCCGAC AGGACTATAA AGATACCAGG CGTTTCCCCC 120 

TGGAAGCTCC CTCGTGCGCT CTCCTGTTCC GACCCTGCCG 16 0 

CTTACCGGAT ACCTCTCCGC CTTTCTCCCT TCGGGAAGCG 200 

TGGCGCTTTC TCAATGCTCA CGCTGTAGGT ATCTCAGTTC 24 0 

GGTGTAGGTC GTTCGCTCCA AGCTGGG CTG TGTGCACGAA 280 

CCCCCCGTTC AGCCCGACCG CTGCGCCTTA TCCGGTAACT 320 

ATCGTCTTGA GTCCAACCCG GTAAGACACG ACTT AT CGCC 360 

ACTGGCAGCA GCCACTGGTA ACAGGATTAG CAGAGCGAGG 4 00 

TATGTAGGCG GTGCTACAGA GTTCTTGAAG TGGTGGCCTA 440 

ACTACGGCTA CAC 453 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii ) HYPOTHETICAL : No 

(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 2 : 

GTGTAGCCGT AGTTAGGCCA CCACTTCAAG AACTCTGTAG 40 

CACCGCCTAC ATACCTCGCT CTGCTAATCC TGTT AC CAGT 80 

GGCTGCTGCC AGTGGCGATA AGTCGTGTCT TACCGGGTTG 120 

GACTCAAGAC GATAGTTACC GGATAAGGCG CAGCGGTCGG 160 

GCTGAACGGG GGGTTCGTGC ACACAGCCCA GCTTGGAGCG 200 

AACGACCTAC ACCGAACTGA GATACCTACA CCGTGAGCAT 240 

TGAGAAAGCG CCACGCTTCC CGAAGGGAGA AAGGCGGACA 280 

GGTATCCGGT AAGCGGCAGG GTCGGAACAG GAGAGCGCAC 320 

GAGGGAGCTT CCAGGGGGAA ACGCCTGGTA TCTTTATAGT 360 

CCTGTCGGGT TTCGCCACCT CTGACTTGAG CGTCGATTTT 400 

TGTGATGCTC GTCAGGGGGG CGGAGCCTAT GGAAAAACGC 440 

CAGCAACGCG GCC 453 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 209 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI- SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 3: 



GAATTCTTTC GGACTTTTGA AAGTGATGGT GGTGGCCGAA 4 0 

GGATTCGAAC CTTCGAAGTC GATGACGGCA GATTTAGAGT 8 0 

CTGCTCCCTT TGGCCGCTCG GGAACCCCAC CACGGGTAAT 120 

GCTTTTACTG GCCTGCTCCC TTATCGGGAA GCGGGGCGCA 160 

T CAT AT CAAA TGACGCGCCG CTGTAAAGTG TTACGTTGAG 2 00 

AAAGAATTC 209 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 09 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 4: 



GAATTCTTTC TCAACGTAAC ACTTTACAGC GGCGCGTCAT 4 0 

TTGATATGAT GCGCCCCGCT TCCCGATAAG GGAGCAGGCC 80 

AGTAAAAGCA TTACCCGTGG TGGGGTTCCC GAGCGGCCAA 120 

AGGGAGCAGA CTCTAAATCT GCCGTCATCG ACTTCGAAGG 160 

TTCGAATCCT TCCCCCACCA CCATCACTTT CAAAAGTCCG 200 

AAAGAATTC 209 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO 



AATAAA 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETI CAL : No 

( iv) ANTI - SENSE : No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO 



ATTAAA 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETI CAL : No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO 



AGTAAA 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 
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( i i i ) HYPOTHETI CAL : No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 8 



AAGAAC 



(2) INFORMATION FOR SEQ ID NO: 9: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

( iii ) HYPOTHETICAL : No 

( iv) ANTI - SENSE : No 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 9 : 



AATACA 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETI CAL : No 

(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 10 



GCCTTAAGGG CCATATGGTG AGTGGATCCC TTGACCCCAG 
GCGGGGATGG GGAGACCTGT AGTCAGAGCC CCCGGGCAGC 
ACAGG CCAAT GCCCGTCCTT CCCCTGCAGG ATGAGTAGTG 
AGTGCCTCTC CTGGCCCTGG AAGTTGCCAC TCCAGTGCCC 
ACCAGCCTTG TCCTAATAAA ATTAAGTTGC ATCATTTTGT 
CTGACTAGGT GTCCTCTATA ATATTAT 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE : cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI- SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 11 



ATAATATTAT AGAGGACACC TAGTCAGAAC AAATGATGCA 
ACTTAATTTT ATTAGGACAA GGCTGGTGGG CACTGGAGTG 
GCAACTTCCA GGGCCAGGAG AGGCACTCAC TACTCATCCT 
GCAGGGGAAG GACGGG CATT GGCCTGTGCT GCCCGGGGGC 
TCTGACTACA GGTCTCCCCC ATCCCCGCCT GGGGTCAAGG 
CATCCACTCA CCATATGGCC CTTAAGG 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 12 



CCTCGGTACC TGCCATGGCG 
AGTTGGTGGA CATATTATGT 
AGCATGACAA AGTTGCAGCC 
GCCCTGGACT GTTGAACGAG 
GACACGCAAA CTGG CGGAAC 
GCGCTTTACT GGCACTTCAG 
GGCCATATGC CG 



CGGATTCTTT AT C ACTG ATA 
TTATCAGTGA TAAAGTGTCA 
GAATACAGTG ATCCGTGCCG 
GTCGGCGTAG ACGGT CTG AC 
GGTTGGGGGT GCAGCAGCCG 
GAACAAGCGG GCGCCTTAAG 



(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 
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(ii) MOLECULE TYPE: cDNA 

( i i i ) HYPOTHETICAL : No 
(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 13: 

CCTCGGTACC TGCCACCATG GCGCGGATTC TTTAT 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

( iii ) HYPOTHETICAL : No 

(iv) ANTI -SENSE : No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 14 
CGGCATATGG CCTTAAGGCG CCCGCTTGTT CCTGAAGT 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 



GCCTTAAGGG CCATATGGTG AGTGGATGCC TTGACCCCAG 
GCGGGGATGG GGGAGACCTG TAGTCAGAGC CCCCGGGCAG 

CACAGGCCAA TGCCCGTCCT TCCCCTGCAG GATGAGTAGT 
GAGTGCCTCT CCTGGCCCTG GAAGTTGCCA CTCCAGTGCC 
CACCAGCCTT GTCCTAATAA AATTAAGTTG CATCATTTTG 
TCTGACTAGG TGTCCTCTAT AATATTAT 
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(2) 



INFORMATION FOR SEQ ID NO: 16: 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 



(ii) 



MOLECULE TYPE: 



cDNA 



(iii) 



HYPOTHETICAL: 



No 



(iv) 



ANTI- SENSE: 



NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



TGCCATGGCG CGGATTCTTT ATCACTGATA AGTTGGTGGA 
CATATTATGT TTATCAGTGA TAAAGTGTCA AG CATGACAA 
AGTTGCAGCC GAATACAGTG ATCCGTGCCG GCCCTGGACT 
GTTGAACGAG GTCGGCGTAG ACGGTCTGAC GACACGCAAA 
CTGGCGGAAC GGTTGGGGGT GCAGCAGCCG GCGCTTTACT 
GGCACTTCAG GAACAAGCGG GCGCCTTAAG GGCCATATGG 
TGAGTGGATG CCTTGACCCC AGGCGGGGAT GGGGGAG AC C 
TGTAGTCAGA GCCCCCGGGC AGCACAGGCC AATGCCCGTC 
CTTCCCCTGC AGTGAGTAGT GACTGCCCGG GTGGGATCCC 
TGTGACCCCT CCCCAGTGCC TCTCCTGGCC CTGGAAGTTG 
CCACTCCAGT GCCCACCAGC CTTGTCCTAA TAAAATTAAG 
TTGCATCATT TTGTCTGACT AGGTGTCCTC TATAATATTA 
TAAGCTTGAT ATCGAATTCT TTCTCAACGT AACACTTTAC 
AGCGGCGCGT CATTTGATAT GATGCGCCCC GCTTCCCGAT 
AAGGG AG CAG GCCAGTAAAA GCATTACCCG TGGTGGGGTT 
CCCGAGCGGC CAAAGGGAGC AGACTCTAAA TCTGCCGTCA 
TCGACTTCGA AGGTTCGAAT CCTTCCCCCA CCACCATCAC 
TTTCAAAAGT CCGAAAGAAT TCCTGCAGCC CGTGTAGCCG 
TAGTTAGGCC ACCACTTCAA GAACTCTGTA GCACCGCCTA 
CATACCTCGC TCTGCTAATC CTGTTACCAG TGGCTGCTGC 
CAGTGGCGAT AAGTCGTGTC TTACCGGGTT GGACTCAAGA 
CGATAGTTAC CGGATAAGGC GCAGCGGTCG GGCTGAACGG 
GGGGTTCGTG CACACAGCCC AGCTTGGAGC GAACGACCTA 
CACCGAACTG AGATACCTAC AGCGTGAGCA TTGAGAAAGC 
GCCACGCTTC CCGAAGGGAG AAAGGCGGAC AGGTATCCGG 
TAAGCGGCAG GGTCGGAACA GGAGAGCGCA CGAGGGAGCT 
TCCAGGGGGA AACGCCTGGT ATCTTTATAG TCCTGTCGGG 
TTTCGCCACC TCTGACTTGA GCGTCGATTT TTGTGATGCT 
CGTCAGGGGG GCGGAGCCTA TGGAAAAACG CCAGCAACGC 
GGCCGGGGGA TCCGGAGAGC TCACT CTAGA TGAGAGAGCA 
GTGAGGGAGA GACAGAGACT CGAATTTCCG GAGCTATTTC 
AGTTTTCTTT TCCGTTTTGT GCAATTTCAC TTATGATACC 

GGCCAATGCT TGGTTGCTAT TTTGGAAACT CCCCTTAGGG 
GATGCCCCTC AACTGGCCCT ATAAAGGGCC AGCCTGAGCT 
GCAGAGGATT CCTGCAGAGG ATCAAGACAG CACGTGGACC 
TCGCACAGCC TCTCCCACAG GTACC 



40 
80 
120 
160 
200 
240 
280 
320 
360 
400 
440 
480 
520 
560 
600 
640 
680 
720 
760 
800 
840 
880 
920 
960 
1000 
1040 
1080 
1120 
1160 
1200 
1240 
1280 

1320 
1360 
1400 
1425 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 719 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 17: 



ATG AG CAAGG GCGAGGAACT GTTCACTGGC GTGGTCCCAA 4 0 

TTCTCGTGGA ACTGGATGGC GATGTGAATG GGCACAAATT 80 

TTCTGTCAGC GGAGAGGGTG AAGGTGATGC CACATACGGA 12 0 

AAGCTCACCC TG AAAT T CAT CTGCACCACT GGAAAGCTCC 16 0 

CTGTGCCATG GCCAACACTG GTCACTACCT TCACCTATGG 200 

CGTGCAGTGC TTTTCCAGAT ACCCAGACCA TATGAACGAG 24 0 

CATGACTTTT TCAAGAGCGC CATGCCCGAG GGCTATGTGC 28 0 

AGGAGAGAAC CATCTTTTTC AAAGATGACG GGAACTACAA 32 0 

GACCCGCGCT GAAGTCAAGT TCGAAGGTGA CACCCTGGTG ^ 360 

AATAGAATCG AGTTGAAGGG CATTGACTTT AAGGAAGATG 4 00 

GAAACATTCT CGGCCACAAG CTGGAATACA ACTATAACTC 44 0 

CCACAATGTG TACATCATGG CCGACAAGCA AAAGAATGGC 480 

AT CAAGGTC A ACTTCAAGAT CAGACACAAC ATTGAGGATG 520 

GATCCGTGCA GCTGGCCGAC CATTATCAAC AGAAC ACT C C 56 0 

AATCGGCGAC CGCCCTGTGC TCCTCCCAGA CAACAATTAC 6 00 

CTGTCCACCC AGTCTGCCCT GTCTAAAGAT CCCAACGAAA 640 

AGAGAGACCA CATGGTCCTG CTGGAGTTTG TGACCGCTGC 6 80 

TGGGATCACA CATGG GATGG ACGAGCTGTA CAAGTGAGC 719 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1911 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
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TATGAGCAAG GGCGAGGAAC TGTTCACTGG CGTGGTCCCA 4 0 

ATTCTCGTGG AACTGGATGG CGATGTGAAT GGG CACAAAT 80 

TTTCTGTCAG CGGAGAGGGT GAAGGTGATG CCACATACGG 12 0 

AAAGCTCACC CTGAAATTCA TCTGCACCAC TGGAAAGCTC 16 0 

CCTGTGCCAT GGCCAACACT GGTCACTACC TTCACCTATG 200 

GCGTGCAGTG CTTTTCCAGA TACCCAGACC ATATGAAGCA 24 0 

GCATGACTTT TTCAAGAGCG CCATGCCCGA GGGCTATGTG 280 

CAGGAGAGAA CCATCTTTTT CAAAGATGAC GGGAACTACA 3 20 

AGACCCGCGC TGAAGTCAAG TTCGAAGGTG ACACCCTGGT 3 60 

GAATAGAATC GAGTTGAAGG GCATTGACTT TAAGGAAGAT 4 00 

GGAAACATTC TCGGCCACAA GCTGGAATAC AACTATAACT 44 0 

CCCACAATGT GTACATCATG GCCGACAAGC AAAAGAATGG 48 0 

CATCAAGGTC AACTTCAAGA TCAGACACAA CATTGAGGAT 52 0 

GGATCCGTGC AGCTGGCCGA CCATTATCAA CAGAACACTC 56 0 

CAATCGGCGA CGGCCCTGTG CTCCTCCCAG ACAACCATTA 600 

CCTGTCCACC CAGTCTGCCC GT CTAAAG AT CCCAACGAAA 64 0 

AGAGAGACCA CATGGTCCTG CTGGAGTTTG TGACCGCTGC 68 0 

TGGGATCACA CATGG CATGG ACGAGCTGTA CAAGTGAGCC 72 0 

ATATGGTGAG TGGATGCCTT GACCCCAGGC GGGGATGGGG 76 0 

G AGAC CTGTA GTCAGAGCCC CCGGGCAGCA CAGGCCAATG 800 

CCCGTCCTTC CCCTGCAGTG AGTAGTGACT GCCCGGGTGG 84 0 

GATCCCTGTG ACCCCTCCCC AGTGCCTCTC CTGGCCCTGG 880 

AAGTTGCCAC TCCAGTGCCC ACCAGCCTTG TCCTAATAAA 920 

ATTAAGTTGC ATCATTTTGT CTGACTAGGT GTCCTCTATA 960 

ATATTATAAG CTTGATATCG AATTCTTTCT CAACGTAACA 1000 

CTTTACAGCG GCGCGTCATT TGATATGATG CGCCCCGCTT 1040 

CCCGATAAGG GAGCAGGCCA GTAAAAG CAT TACCCGTGGT 1080 

GGGGTTCCCG AG CGGC CAAA GGGAGCAGAC TCTAAATCTG 112 0 

CCGTCATCGA CTTCGAAGGT TCGAATCCTT CCCCCACCAC 1160 

CATCACTTTC AAAAGTCCGA AAGAATTCCT GCAGCCCGTG 12 00 

TAGCCGTAGT TAGGCCACCA CTTCAAGAAC TCTGTAGCAC 124 0 

CGCCTACATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC 12 80 

TGCTGCCAGT GGCGATAAGT CGTGTCTTAC CGGGTTGGAC 1320 

TCAAGACGAT AGTTACCGGA TAAGGCGCAG CGGTCGGGCT 13 60 

GAACGGGGGG TTCGTGCACA CAGCCCAGCT TGGAGCGAAC 14 00 

GACCTACACC GAACTGAGAT ACCTACAGCG TGAGCATTGA 144 0 

GAAAGCGCCA CGCTTCCCGA AGGGAGAAAG GCGGACAGGT 1480 

ATCCGGTAAG CGGCAGGGTC GGAACAGGAG AGCGCACGAG 1520 

GGAGCTTCCA GGGGGAAACG CCTGGTATCT TTATAGT CCT 1560 

GTCGGGTTTC GCCACCTCTG ACTTGAGCGT CGATTTTTGT 16 00 

GATGCTCGTC AGGGGGGCGG AGCCTATGG A AAAACGC CAG 164 0 

CAACGCGGCC GGGGGATCCG GAGAGCTCAC TCTAGATGAG 1680 

AGAGCAGTGA GGGAGAGACA GAGACTCGAA TTTCCGGAGC 1720 

TATTTCAGTT TTCTTTTCCG TTTTGTGCAA TTTCACTTAT 1760 

GATACCGGCC AATGCTTGGT TGCTATTTTG GAAACTCCCC 1800 

TTAGGGGATG CCCCTCAACT GGCCCTATAA AGGGCCAGCC 1840 

TGAGCTGCAG AGGATTCCTG CAGAGGATCA AGACAGCACG 1880 

TGGACCTCGC ACAGCCTCTC CCACAGGTAC C 1911 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



Pro 


Asp 


Leu 


Ser 


Tyr 


Met 


Pro 


He 


Trp 


Lys 


Phe 


Pro 


1 








5 










10 






Asp 


Glu 


Glu 


Gly 


Ala 


Cys 


Gin 


Pro 


Cys 


Pro 


lie 


Asn 




15 










20 










Cys 


Thr 


His 


Ser 


Cys 


Val 


Asp 


Leu 


Asp 


Asp 


Lys 


Gly 


25 










30 










35 




Cys 


Pro 


Ala 


Glu 


Gin 


Arg 


Ala 


Ser 


Pro 


Leu 


Thr 


Ser 






40 










45 








He 


He 


Ser 


Ala 


Val 


Val 


Gly 


He 


Leu 


Leu 


Val 


Val 




50 










55 










60 


Val 


Leu 


Gly 


Val 


Val 


Phe 


Gly 


He 


Leu 









65 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 287 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



Pro 


Ala 


Pro 


Gly Ala 


Gly Gly 


Met Val 


His 


His 


Arg 


1 








5 






10 






His 


Arg 


Ser 


Ser 


Ser 


Thr Arg 


Ser Gly 


Gly 


Gly 


Asp 






15 








20 








Leu 


Thr 


Leu 


Gly Leu 


Glu Pro 


Ser Glu 


Glu 


Glu 


Ala 


25 










30 






35 




Pro 


Arg 


Ser 


Pro 


Leu 


Ala Pro 


Ser Glu 


Gly 


Ala 


Gly 








40 






45 








Ser 


Asp 


Val 


Phe 


Asp 


Gly Asp 


Leu Gly 


Met 


Gly 


Ala 




50 








55 








60 


Ala 


Lys 


Gly Leu 


Ser 


Leu Pro 


Thr His 


Asp 


Pro 


Ser 










65 






70 






Pro 


Leu 


Gin Arg 


Tyr 


Ser Glu 


Asp Pro 


Thr 


Val 


Pro 






75 








80 








Leu 


Pro 


Ser 


Glu 


Thr Asp Gly 


Tyr Val 


Ala 


Pro 


Leu 


85 










90 






95 




Thr 


Cys 


Ser 


Pro 


Gin 


Pro Glu 


Tyr Val 


Asn 


Gin 


Pro 






100 






105 








Asp 


Val 


Arg 


Pro 


Pro 


Pro Ser 


Pro Arg 


Glu 


Gly 


Pro 


110 








115 








120 


Leu 


Pro 


Ala 


Ala 


Arg 


Pro Ala 


Gly Ala 


Thr 


Leu 


Glu 










125 






130 






Arg 


Pro 


Lys 


Thr 


Leu 


Ser Pro 


Gly Lys 


Asn 


Gly 


Val 






135 








140 
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Val 


Lys 


Asp 


Val 


Phe 


Aia 


rTie 


vjiy 


vjiy 


/iJ.a 


Val 


Glu 


145 






150 










ice 




Asn 


Pro 


Glu 


Tyr 
160 


Leu 


Thr 


Pro 


(j±n 


O-Ly 
165 


1 XI i 


L Y S 


C e» t~ 
Del 


Pro 


Gin 


Pro 


Glu 


Tyr 


Val 


Asn 


(j-Ln 


Pro 


Asp 


V CL _L 




170 








175 










loU 


Pro 


Gin 


Pro 


Pro 


Ser 


Pro 


Arg 


Glu 


Gly 


Pro 


Leu 


Pro 








185 










190 






Ala 


Ala 


Arg 
195 


Pro 


Ala 


Gly 


Ala 


Tnr 
200 


Leu 


Vj J-U 


Arg 


Pro 


Lys 


Leu 


Ser 


Pro 


Gly 


Lys 


Asn 


Gly 


Val 


Val 


Lys 


Asp 


205 










210 










215 


Glu 


val 


Phe 


Ala 


Phe 


Gly 


Gly 


Ala 


Val 


Glu 


Asn 


Pro 








220 








225 








Tyr 


Leu 


Thr 


Pro 


Gin 


Gly 


Gly 


Ala 


Ala 


Pro 


Gin 


Pro 


230 










235 










240 


His 


Pro 


Pro 


Pro 


Ala 


Phe 


Ser 


Pro 


Ala 


Phe 


Asp 


Asn 








245 










250 






Leu 


Tyr 


Tyr 


Trp 


Asp 


Asp 


Pro 


Pro 


Glu 


Arg 


Gly Ala 




255 










260 










Pro 


Pro 


Ser 


Thr 


Phe 


Lys 


Gly 


Thr 


Pro 


Thr 


Ala 


Glu 


260 










270 










275 




Asn 


Pro 


Glu 


Tyr 
280 


Leu 


Gly 


Leu 


Asp 


Val 
285 


Pro 


Val 





(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 21: 



lie lie Ser Ala Val Val Gly He Leu Leu Val Val 
15 10 

Val Leu Gly Val Val Phe Gly He Leu He 
15 20 



(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2125 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 
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(ii) MOLECULE TYPE: cDNA 

{ i i i ) HYPOTHETICAL : No 
{ i v) ANTI - SENSE : No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:22: 



GCCACCATGG CCCCTGACCT CTCCTACATG CCCATCTGGA 4 0 

AGTTTCCAGA TGAGGAGGGC GCATGCCAGC CTTGCCCCAT 80 

CAACTGCACC CACTCCTGTG TGGACCTGGA TGACAAGGGC 12 0 

TGCCCCGCCG AGCAGAGAGC CAGCCCTCTG ACGTCCATCA 16 0 

TCTCTGCGGT GGTTGGCATT CTGCTGGTCG TGGT CTTGGG 200 

GGTGGTCTTT GGGATCCTCA TCAAGCGACG GCAGCAGAAG 24 0 

ATCACATGTC CAGACCCTGC CCCGGGCGCT GGGGGCATGG 2 80 

TCCACCACAG GCACCGCAGC TCATCTACCA GGAGTGGCGG 32 0 

TGGGGACCTG ACACTAGGGC TGGAGCCCTC TGAAGAGGAG 3 60 

GCCCCCAGGT CTCCACTGGC ACCCTCCGAA GGGGCTGGCT 4 00 

CCGATGTATT TGATGGTGAC CTGGGAATGG GGGCAGCCAA 440 

GGGGCTGCAA AGCCTCCCCA CACATGACCC CAGCCCTCTA 4 80 

C AG CGGTACA GTGAGGACCC CACAGTACCC CTGCCCTCTG 520 

AGACTGATGG CTACGTTGCC CCCCTGACCT GCAGCCCCCA 560 

GCCTGAATAT GTGAACCAGC CAGATGTTCG GCCCCAGCCC 600 

CCTTCGCCCC GAGAGGGCCC TCTGCCTGCT GCCCGACCTG 64 0 

CTGGTGCCAC TCTGGAAAGG CCCAAGACTC TCTCCCCAGG 680 

GAAGAATGGG GTCGTCAAAG ACGTTTTTGC CTTTGGGGGT 720 

GCCGTGGAGA ACCCCGAGAC TTGACACCCC AGGGAGGAGC 760 

TGCCCCTCAG CCCCACCCTC CTCCTGCCTT CAGCCCAGCC 800 

TTCGACAACC TCTATTACTG GGACCAGGAC CCACCAGAGC 840 

GGGGGGCTCC ACCCAGCACC TTCAAAGGGA CACCTACGGC 880 

AGAGAACCCA GAGTACCTGG GTCTGGACGT GCCAGTGTGA 92 0 

AGCCTTAAGG GCCATATGGT GAGTGGATGC CTTGACCCCA 960 

GGCGGGGATG GGGGAGACCT GTAGTCAGAG CCCCCGGGCA 1000 

GCACAGGCCA ATGCCCGTCC TTCCCCTGCA GTGAGTAGTG 1040 

ACTGCCCGGG TGGGATCCCT GTGACCCCTC CCCAGTGCCT 1080 

CTCCTGGCCC TGGAAGTTGC CACTCCAGTG CCCACCAGCC 1120 

TTGTCCTAAT AAAATTAAGT TGCATCATTT TGTCTGACTA 1160 

GGTGTCCTCT ATAATATTAT AAGCTTGATA TCGAATTCTT 1200 

TCTCAACGTA ACACTTTACA GCGGCGCGTC ATTTGATATG 1240 

ATGCGCCCCG CTTCCCGATA AGGGAGCAGG CCAGTAAAAG 1280 

CATTACCCGT GGTGGGGTTC CCGAGCGGCC AAAGGGAGCA 1320 

GACTCTAAAT CTGCCGTCAT CGACTTCGAA GGTTCGAATC 1360 

CTTCCCCCAC CACCATCACT TTCAAAAGTC CGAAAGAATT 14 00 

CCTGCAGCCC GTGTAGCCGT AGTTAGGCCA CCACTTCAAG 1440 

AACT CTGTAG CACCGCCTAC ATACCTCGCT CTGCTAATCC 1480 

TGTTACCAGT GGCTGCTGCC AGTGG CGATA AGTCGTGTCT 1520 

TACCGGGTTG GACTCAAGAC GATAGTTACC GGATAAGGCG 1560 

CAGCGGTCGG GCTGAACGGG GGGTTCGTGC ACACAGCCCA 1600 

GCTTGGAGCG AACGACCTAC ACCGAACTGA GATACCTACA 1640 

GCGTGAGCAT TGAGAAAGCG CCACGCTTCC CGAAGGGAGA 1680 

AAGG CGGACA GGTATCCGGT AAGCGGCAGG GTCGGAACAG 1720 

GAGAGCGCAC GAGGGAGCTT CCAGGGGGAA ACGCCTGGTA 1760 

TCTTTATAGT CCTGTCGGGT TTCGCCACCT CTGACTTGAG 1800 

CGTCGATTTT TGTGATGCTC GTCAGGGGGG CGGAGCCTAT 1840 

GGAAAAACGC CAGCAACGCG GCCGGGGGAT CCGGAGAGCT 1880 

CACTCTAGAT GAGAGAGCAG TGAGGGAGAG ACAGAGACTC 1920 

GAATTTCCGG AGCTATTTCA GTTTTCTTTT CCGTTTTGTG 1960 
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CAATTT C ACT TATGATACCG GCCAATGCTT GGTTGCTATT 
TTGGAAACTC CCCTTAGGGG ATGCCCCTCA ACTGGCCCTA 
TAAAGGGCCA GCCTGAGCTG CAGAGGATTC CTGCAGAGGA 
TCAAGACAGC ACGTGGACCT CGCACAGCCT CTCCCACAGG 
TACCT 



(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: oligonucleotide 
( i i i ) HYPOTHETI CAL : No 

(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:23: 
GTCTGCCACC ATGGCCTACT CCCCTGC 

(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: oligonucleotide 
( i i i ) HYPOTHETI CAL : No 

(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 24: 
TTCTTTGGTG ACCTACCTCT TCGGAATTGC CGAGTC 
(2) INFORMATION FOR SEQ ID NO: 25: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1242 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 



448026 1 



- 89 - 

(ii) MOLECULE TYPE: cDNA 

( i i i ) HYPOTHETICAL : No 
( i v) ANT I - SENSE : No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:25: 



ATGGAGG AG C CGCAGTCAGA TCCTAGCGTC GAGCCCCCTC 4 0 

TGAGTCAGGA AACATTTTCA GACCTATGGA AACTACTTCC 8 0 

TGAAAACAAC GTTCTGTCCC CCTTGCCGTC CCAAGCAATG 12 0 

GATGATTTGA TGCTGTCCCC GGACGATATT GAACAATGGT 16 0 

TCACTGAAGA CCCAGGTCCA GATGAAGCTC CCAGAATGCC 2 00 

AGAGGCTGCT CCCCGCGTGG CCCCTGCACC AGCAGCTCCT 24 0 

ACACCGGCGG CCCCTGCACC AGCCCCCTCC TGGCCCCTGT 28 0 

CATCTTCTGT CCCTTCCCAG AAAACCTACC AGGGCAGCTA 32 0 

CGGTTTCCGT CTGGGCTTCT TGCATTCTGG GACAGCCAAG 3 60 

TCTGCCACCA TGGCCTACTC CCCTGCGTCT GTGACTTGCA 4 00 

CGTACTCCCC TGCCCTCAAC AAGATGTTTT GCCAACTGGC 44 0 

CAAGACCTGC CCTGTGCAGC TGTGGGTTGA TTCCACACCC 4 80 

CCGCCCGGCA CCCGCGTCCG CGCCATGGCC ATCTACAAGC 520 

AGT CACAGCA CATGACGGAG GTTGTGAGGC GCTGCCCCCA 56 0 

CCATGAGCGC TG CT C AG ATA GCGATGGTCT GGCCCCTCCT 6 00 

CAGCGTCTTA TCCGAGTGGA AGGAAATTTG CGTGTGGAGT 64 0 

ATTTGGATGA CAGAAACACT TTT CGACATA GTGTGGTGGT 680 

GCCCTATGAG CCGCCTGAGG TTGGCTCTGA CTGTACCACC 72 0 

ATCCACTACA ACTACATGTG TAACAGTTCC TGCATGGGCG 76 0 

GCATGAACCG GAGGCCCATC CTCACCATCA TCACACTGGA 800 

AGACTCCAGT GGTAATCTAC TGGGACGGAA CAGCTTTGAG 84 0 

GTGCGTGTTT GTGCCTGTCC TGGGAGAGAC CGGCGCACAG 8 80 

AGGAAGAGAA TCTCCGCAAG AAAGGGGAGC CTCACCACGA 92 0 

GCTGCCCCCA GGGAG CACT A AGCGAGCACT GCCCAACAAC 960 

ACCAGCTCCT CTCCCCAGCC AAAGAAGAAA CCACTGGATG 100 0 

GAGAATATTT CACCCTTCAG ATCCGTGGGC GTGAGCGCTT 104 0 

CGAGATGTTC TTTGGTGACC TACCTCTTCG GAATTGCCGA 108 0 

GTCTTCCGAG AG CTG AATGA GGCCTTGGAA CTCAAGGATG 1120 

CCCAGGCTGG GAAGGAGCCA GGGGGGAG C A GGGCTCACTC 1160 

CAGCCACCTG AAGTCCAAAA AGGGTCAGTC TACCTCCCGC 1200 

CATAAAAAAC TCATGTTCAA GACAGAAGGG CCTGACTCAG 124 0 

AC 1242 



(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 608 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 
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(iv) ANTI- SENSE: No 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



CTCGGGCCGC GTTGCTGGCG TTTTTCCATA GGCTCCGCCC 40 

CCCTGACGAG CATCACAAAA ATCGACGCTC AAGTCAGAGG 80 

TGGCGAAACC CGACAGGACT ATAAAGATAC CAGGCGTTTC 120 

CCCCTGGAAG CTCCCTCGTG CGCTCTCCTG TTCCGACCCT 160 

GCCGCTTACC GGATACCTGT CCGCCTTTCT CCCTTCGGGA 200 

AGCGTGGCGC TTTCTCAATG CTCACGCTGT AGGTATCTCA 240 

GTTCGGTGTA GGTCGTTCGC TCCAAGCTGG GCTGTGTGCA 280 

CGAACCCCCC GTTCAGCCCG ACCGCTGCGC CTTATCCGGT 3 20 

AACTATCGTC TTGAGTCCAA CCCGGTAAGA CACGACTTAT 3 60 

CGCCACTGGC AGCAGCCACT GGTAACAGGA TTAGCAGAGC 400 

GAGGTATGTA GGCGGTGCTA CAGAGTTCTT GAAGTGGTGG 440 

CCTAACTACG GCTACACTAG AAGGACAGTA TTTGGTATCT 480 

GCGCTCTGCT GAAGCCAGTT ACCTTCGGAA AAAGAGTTGG 520 

TAGCTCTTGA TCCGGCAAAC AAACCACCGC TGGTAGCGGT 560 

GGTTTTTTTG TTTGCAAGCA GCAGATTACG CGCAGAAAAA 600 

AAGGATCT 608 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 
( i i i ) HYPOTHETI CAL : No 
(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:27: 



GGTACCTGCC ACCATGGCGC GGATTCTTTA TCACTGATAA 4 0 

GTTGGTGGAC ATATTATGTT TAT CAGTGAT AAAGTGTCAA 80 

GCATGACAAA GTTGCAGCCG AATACAGTGA TCCGTGCCGG 120 

CCCTGGACTG TTGAACGAGG TCGGCGTAGA CGGTCTGACG 160 

ACACGCAAAC TGGCGGAACG GTTGGGGGTG CAGCAGCCGG 2 00 

CGCTTTACTG GCACTTCAGG AACAAGCGGG CGCCTTAAGG 240 

GCCATATGGT GAGTGGATGC CTTGACCCCA GGCGGGGATG 2 80 

GGGGAGACCT GTAGTCAGAG CCCCCGGGCA GCACAGGCCA 320 

ATGCCCGTCC TTCCCCTGCA GGATGAGTAG TGAGTGCCTC 3 60 

TCCTGGCCCT GGAAGTTGCC ACTCCAGTGC CCACCAGCCT 4 00 

TGTCCTAATA AAATTAAGTT GCATCATTTT GTCTGACTAG 4 40 

GTGTCCTCTA TAATATTATA AGCTTGATAT CGAATTCTTT 4 80 

CGGACTTTTG AAAGTGATGG TGGTGGGGGA AGGATTCGAA 520 

CCTTCGAAGT CGATGACGGC AGATTTAGAG TCTGCTCCCT 560 

TTGGCCGCTC GGGAACCCCA CCACGGGTAA TGCTTTTACT 600 

GGCCTGCTCC CTTATCGGGA AGCGGGGCGC ATCATATCAA 640 

ATGACGCGCC GCTGTAAAGT GTTACGTTGA GAAAGAATTC 6 80 

CTGCAGCCCG CCGCGTTGCT GGCGTTTTTC CATAGGCTCC 720 
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GCCCCCCTGA CGAGCATCAC AAAAATCGAC GCTCAAGTCA 760 

GAGGTGGCGA AACCCGACAG GACTATAAAG AT AC CAGGCG 800 

TTTCCCCCTG GAAGCTCCCT CGTGCGCTCT CCTGTTCCGA 840 

CCCTGCCGCT TACCGGATAC CTGTCCGCCT TTCTCCCTTC 880 

GGG AAGCGTG GCGCTTTCTC AATGCTCACG CTGTAGG TAT 920 

CTCAGTTCGG TGTAGGTCGT TCGCTCCAAG CTGGGCTGTG 960 

TGCACGAACC CCCCGTTCAG CCCGACCGCT GCGCCTTATC 1000 

CGGTAACTAT CGTCTTGAGT CCAACCCGGT AAGACACGAC 104 0 

TTATCGCCAC TGGCAGCAGC CACTGGTAAC AGGATTAGGA 1080 

GAG CG AGGTA TGTAGGCGGT GCTACAGAGT TCTTGAAGTG 1120 

GTGGCCTAAC TACGGCTACA CTAGAAGGAC AGTATTTGGT 1160 

ATCTGCGCTC TGCTGAAGCC AGTTACCTTC GGAAAAAGAG 1200 

TTGGTAGCTC TTGATCCGGC AAACAAACCA CCGCTGGTAG 1240 

CGGTGGTTTT TTTGTTTGCA AGCAG GAG AT TACG CGCAGA 1280 

AAAAAAGGAT CTGGGGGATC CGGAGAGCTC ACTCTAGATG 1320 

AG AG AG CAGT GAGGGAGAGA CAGAGACTCG AATTTCCGGA 1360 

GCTATTTCAG TTTTCTTTTC CGTTTTGTGC AATTTCACTT 14 00 

ATGATACCGG CCAATGCTTG GTTGCTATTT TGGAAACTCC 1440 

CCTTAGGGGA TGCCCCTCAA CTGGCCCTAT AAAGGGCC AG 1480 

CCTGAGCTGC AGAGGATTCC TGCAGAGGAT CAAGACAGCA 1520 

CGTGGACCTC GCACAGCCTC TCCCACA 1547 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



GGTACCTGCC ACCATGGCGC GGATTCTTTA TCACTGATAA 40 

GTTGGTGGAC ATATTATGTT TATCAGTGAT AAAGTGTCAA 80 

GCATGACAAA GTTGCAGCCG AATACAGTGA TCCGTGCCGG 120 

CCCTGGACTG TTGAACGAGG T CGG CGTAG A CGGTCTGACG 160 

ACACG CAAAC TGGCGGAACG GTTGGGGGTG CAGCAGCCGG 200 

CGCTTTACTG GCACTTCAGG AACAAGCGGG CGCCTTAAGG 24 0 

GCCATATGGT GAGTGGATGC CTTGACCCCA GG CGGGGATG 280 

GGGGAGACCT GTAGTCAGAG CCCCCGGGCA GCACAGGCCA 320 

ATGCCCGTCC TTCCCCTGCA GGATGAGTAG TGAGTGCCTC 360 

TCCTGGCCCT GGAAGTTGCC ACTCCAGTGC CCACCAGCCT 400 

TGTCCTAATA AAATTAAGTT GCAT CATTTT GTCTGACTAG 440 

GTGTCCTCTA TAATATTATA AGCTTGATAT CGAATTCTTT 4 80 

CGGACTTTTG AAAGTGATGG TGGTGGGGGA AGGATTCGAA 520 

CCTTCGAAGT CGATGACGGC AGATTTAGAG TCTGCTCCCT 560 

TTGGCCGCTC GGG AACCCCA CCACGGGTAA TGCTTTTACT 600 

GGCCTGCTCC CTTAT CGGGA AGCGGGGCGC ATCATATCAA 640 

ATGACGCGCC GCTGTAAAGT GTTACGTTGA GAAAGAATTC 680 
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CTGCAGCCCG CCGCGTTGCT GGCGTTTTTC CATAGGCTCC 720 

GCCCCCCTGA CGAGCATCAC AAAAATCGAC GCTCAAGTCA 760 

GAGGTGGCGA AAC CCGACAG GACTATAAAG ATACCAGGCG 800 

TTTCCCCCTG GAAGCTCCCT CGTGCGCTCT CCTGTTCCGA 840 

CCCTGCCGCT TAC CGGATAC CTGTCCGCCT TTCTCCCTTC 88 0 

GGGAAGCGTG GCGCTTTCTC AATGCTCACG CTGTAGGTAT 920 

CTCAGTTCGG TGTAGGTCGT TCGCTCCAAG CTGGGCTGTG 960 

TGCACGAACC CCCCGTTCAG CCCGACCGCT GCGCCTTATC 1000 

CGGTAACTAT CGTCTTGAGT CCAACCCGGT AAGACACGAC 104 0 

TTATCGCCAC TGGCAGCAGC CACTGGTAAC AGGATTAGCA 1080 

GAGCGAGGTA TGTAGGCGGT GCTACAGAGT TCTTGAAGTG 1120 

GTGGCCTAAC TACGGCTACA CTAGAAGGAC AGTATTTGGT 1160 

ATCTGCGCTC TGCTGAAGCC AGTTACCTTC GGAAAAAGAG 1200 

TTGGTAGCTC TTGATCCGGC AAACAAACCA CCGCTGGTAG 124 0 

CGGTGGTTTT TTTGTTTGCA AGCAGCAGAT TACGCGCAGA 128 0 

AAAAAAGGAT CTGGGGG AT C CGGAGAGCTC CCAACGCGTT 1320 

GGATGCATGG ATGAGGGAAA GGAGGTAAGA TCTGTAATGA 136 0 

ATAAGCAGGA ACTTTGAAGA CTCAGTGACT CAGTGAGTAA 1400 

TAAAGACTCA GTGACTTCTG ATCCTGTCCT AACTGC CACT 144 0 

CCTTGTTGTC CCAAGAAAGC GGCTTCCTGC TCTCTGAGGA 14 8 0 

GGACCCCTTC CCTGGAAGGT AAAACTAAGG ATGT CAGCAG 152 0 

AGAAATTTTT CCACCATTGG TGCTTGGTCA AAGAGGAAAC 156 0 

TGATGAGCTC ACTCTAGATG AGAGAGCAGT GAGGGAGAGA 160 0 

CAGAGACTCG AATTTCCGGA GCTATTTCAG TTTTCTTTTC 164 0 

CGTTTTGTGC AATTTCACTT ATGATACCGG CCAATGCTTG 1680 

GTTGCTATTT TGGAAACTCC CCTTAGGGGA TGCCCCTCAA 1720 

CTGGCCCTAT AAAGGGCCAG CCTGAGCTGC AGAGGATTCC 176 0 

TGCAGAGGAT CAAGACAGCA CGTGGACCTC GCACAGCCTC 1800 
TCCCACA 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI- SENSE: No 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 



GGTACCTGCC ACCATGGCGA AGGGCGAGGA ACTGTTCACT 40 

GGCGTGGTCC CAATTCTCGT GGAACTGGAT GGCGATGTGA 80 

ATGGGCACAA ATTTTCTGTC AGCGGAGAGG GTGAAGGTGA 12 O 

TGCCACATAC GGAAAGCTCA CCCTGAAATT CATCTGCACC 160 

ACTGGAAAGC TCCCTGTGCC ATGGCCAACA CTGGTCACTA 2 00 

CCTTCACCTA TGGCGTGCAG TGCTTTTCCA GATACCCAGA 240 

CCATATGAAG CAGCATGACT TTTTCAAGAG CGCCATGCCC 280 
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GAGGGCTATG TG C AGG AG AG AACCATCTTT TTCAAAGATG 32 0 

ACGGGAACTA CAAGACCCGC GCTGAAGTCA AGTTCGAAGG 3 60 

TGACACCCTG GTGAATAGAA TCGAGTTGAA GGGCATTGAC 400 

TTTAAGGAAG ATGGAAACAT TCTCGGCCAC AAGCTGGAAT 44 0 

ACAACTATAA CTCCCACAAT GTGTACATCA TGGCCGACAA 4 80 

GCAAAAGAAT GGCATCAAGG TCAACTTCAA GATCAGACAC 52 0 

AACATTGAGG ATGGATCCGT GCAGCTGGCC GACCATTATC 56 0 

AACAGAACAC TCCAATCGGC GACGGCCCTG TGCTCCTCCC 6 00 

AGACAACCAT TACCTGTCCA CCCAGTCTGC CCTGTCTAAA 64 0 

GATCCCAACG AAAAGAGAGA CCACATGGTC CTGCTGGAGT 6 80 

TTGTGACCGC TGCTGGGATC ACACATGGCA TGGACGAGCT 72 0 

GTACAAGTGA GCGCCTTAAG GGCCATATGG TGAGTGGATG 76 0 
CCTTGACCCC AGG CGGGG AT GGGGGAGACC TGTAGTCAGA 8 00 

GCCCCCGGGC AGCACAGGCC AATGCCCGTC CTTCCCCTGC 84 0 

AGGATGAGTA GTGAGTGCCT CTCCTGGCCC TGGAAGTTGC 88 0 

CACTCCAGTG CCCACCAGCC TTGTCCTAAT AAAATTAAGT 92 0 

TGGATCATTT TGTCTGACTA GGTGTCCTCT ATAATATTAT 960 

AAGCTTGATA TCGAATTCTT TCGGACTTTT GAAAGTGATG 10 0 0 

GTGGTGGGGG AAGGATTCGA ACCTTCGAAG TCGATGACGG 104 0 

CAGATTTAGA GTCTGCTCCC TTTGGCCGCT CGGGAACCCC 1080 

ACCACGGGTA ATGCTTTTAC TGGCCTGCTC CCTTATCGGG 112 0 

AAGCGGGGCG CAT C AT ATCA AATGACGCGC CGCTGTAAAG 116 0 

TGTTACGTTG AGAAAGAATT CCTGCAGCCC GCCGCGTTGC 12 0 0 

TGG CGTTTTT CCATAGGCTC CGCCCCCCTG ACGAGCATCA 124 0 

C AAAAAT CG A CGCTCAAGTC AGAGGTGGCG AAACCCGACA 128 0 

GGACTATAAA G AT AC CAGGC GTTTCCCCCT GGAAGCTCCC 132 0 

TCGTGCGCTC TCCTGTTCCG ACCCTGCCGC TTACCGGATA 13 6 0 

CCTGTCCGCC TTTCTCCCTT CGGGAAGCGT GGCGCTTTCT 14 0 0 

CAATGCTCAC GCTGTAGGTA TCTCAGTTCG GTGTAGGTCG 14 4 0 

TTCGCTCCAA GCTGGGCTGT GTGCACGAAC CCCCCGTTCA 14 8 0 

GCCCGACCGC TGCGCCTTAT CCGGTAACTA TCGTCTTGAG 152 0 

TCCAACCCGG TAAGACACGA CTTATCGCCA CTGGCAGCAG 156 0 

CCACTGGTAA CAGGATTAGC AG AG CGAGGT ATGTAGGCGG 16 00 

TGCTACAGAG TTCTTGAAGT GGTGGCCTAA CTACGGCTAC 164 0 

ACTAGAAGGA CAGTATTTGG TATCTGCGCT CTGCTGAAGC 16 80 

CAGTTACCTT CGGAAAAAGA GTTGGTAGCT CTTGATCCGG 1720 

CAAACAAACC ACCGCTGGTA GCGGTGGTTT TTTTGTTTGC 1760 

AAG C AG C AG A TTACGCGCAG AAAAAAAGGA TCTGGGGGAT 18 00 

CCGGAGAGCT CCCAACGCGT TGGATGCATG GATGAGGGAA 184 0 

AGGAGGTAAG ATCTGTAATG AATAAGCAGG AACTTTGAAG 18 80 

ACT C AGTG AC TCAGTGAGTA ATAAAGACTC AGTGACTTCT 1920 

GATCCTGTCC TAACTGCCAC TCCTTGTTGT CCCAAGAAAG 1960 

CGGCTTCCTG CTCTCTGAGG AGGACCCCTT CCCTGGAAGG 2000 

TAAAACTAAG GATGTCAGCA GAGAAATTTT TCCACCATTG 2 04 0 

GTGCTTGGTC AAAGAGGAAA CTGATGAGCT CACTCTAGAT 2080 

GAGAGAGCAG TGAGGGAGAG ACAGAGACTC GAATTTCCGG 2120 

AGCTATTTCA GTTTT CTTTT CCGTTTTGTG CAATTTCACT 2160 

TATGATACCG GCCAATGCTT GGTTGCTATT TTGGAAACTC 22 00 

CCCTTAGGGG ATGCCCCTCA ACTGGCCCTA TAAAGGG CCA 224 0 

GCCTGAGCTG CAGAGGATTC CTGCAGAGGA TCAAGACAGC 2280 

ACGTGGACCT CGCACAGCCT CTCCCACA 23 08 
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